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Abstract. Choo-Siow (2006) proposed a model for the marriage market which allows for 
random identically distributed noise in the preferences of each of the participants. The 
randomness is McFadden-type, which permits an explicit resolution of the equilibrium 
preference probabilities. 

The purpose of this note is to prove uniqueness of the resulting equilibrium marriage 
distribution, and find a representation of it in closed form. This allows us to derive smooth 
dependence of this distribution on exogenous preference and population parameters, and 
establish sign, symmetry, and size of the various substitution effects, facilitating compar- 
ative statics. For example, we show that an increase in the population of men of any given 
type in this model leads to an increase in single men of each type, and a decrease in single 
women of each type. We show that an increase in the number of men of a given type 
increases the equilibrium transfer paid by such men to their spouses, and also increases 
the percentage of men of that type who choose to remain unmarried. While the above 
trends may not seem surprising, the verification of such properties helps to substanti- 
ate the validity of the model. Moreover, we make unexpected predictions which could be 
tested: namely, the percentage change of type i unmarrieds with respect to fluctuations in 
the total number of type j men or women turns out to form a symmetric positive-definite 
matrix r%j = rji in this model, and thus to satisfy bounds such as \m \ < ( r u r n) 1 ^ 2 ■ 

Along the way, we give a new proof for the existence of an equilibrium, based on 
a strictly convex variational principle and a simple estimate, rather than a fixed point 
theorem. Fixed point approaches to the existence part of our result have been explored 
by others [5] [E] [E], but are much more complicated and yield neither uniqueness, nor 
comparative statics, nor an explicit representation of the solution. 
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1. Introduction 

The classic transferable utility framework which Becker used to model the marriage mar- 
ket was augmented by Choo and Siow [7J to allow for the possibility that agents' preferences 
might be only partly determined by observable characteristics, and might therefore include 
a stochastic component depending on unobservable characteristics. The randomness was 
chosen to be McFadden type [19], and spreads the preferences of agents on one side of 
the marriage market over the entire type-distribution of agents on the other, thus yielding 
non-assortative matching, a feature of the marriage market that has long been observed 
empirically [7J. The model is non-parametric and highly tractable; given an observation of 
marriages between different types of agents, there is a simple closed form, 'point-identified' 
expression for the average utility (or 'total gains') generated by each type of marriage. Of 
use to econometricians, demographers, and economic theorists is a solution to the back- 
wards problem: given the total gains to each type of marriage and population data, what 
can be said about possible distributions of marriages corresponding to those parameters? 
Basic questions include existence and uniqueness of such a marital distribution; computing 
comparative statics is even more interesting. How will the marital distribution respond to 
changes in composition of the population? How will it change due to policy shocks whose 
effects can be summarized by revising the total gains parameters? The purpose of the 
present note is to address these questions. 

A priori, it is not evident whether each choice of gains parameters leads to the existence 
of a distribution of marriages which clears the market [6] |12j [8]. Although it is not the 
main point of our paper, we begin by reconfirming that it does. We then go on to show this 
equilibrium is unique, and give an explicit formula for the resulting marriage distribution, 
essentially solving the model completely. This allows us to derive smooth dependence 
of the resulting equilibrium state on the specified population and preference parameters, 
and establish sign, symmetry and bounds on the response of the predicted distribution of 
marriages to changes in each parameter. Although existence of an equilibrium was also 
discussed by Choo, Seitz and Siow [6] and Fox [12] (and by Dagsvik [8] for a related model), 
our proof relies on the reformulation of the problem as a variational minimization, hence is 
simpler than the fixed point argument they suggest. More importantly, it yields the solution 
in closed form. Our uniqueness result is the first concerning this model [5], and is based on 
convexity (in appropriate variables) of the new variational principle formulated in Section 
§[5j The variational technique we use is powerful, because it characterizes the endogenous 
variables of interest as the critical points of a function. Indeed, the competitive equilibrium 
in this model turns out to be realized as the minimum of a strictly convex function (given 
in Theorem [T] below), facilitating its subsequent analysis. 

1.1. Organization. The remainder of this introduction provides further motivation and 
background for the Choo-Siow model. Section $2] details the model and could be skipped 
by readers familiar with the Choo-Siow model, except that it ends with a summary placing 
our results in the context of related literature. Further comments concerning the derivation 
of the model may be found in Appendix |A] Section $3] states our results formally; it is 
followed by a section containing derived statics and one further conjecture. Section $5] 
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proves the existence and uniqueness of equilibria, while $6] is devoted to the comparative 
statics established in Theorem [2j Section $7] establishes the further comparative static 
assertions of Corollary [3] and $H 

1.2. Further remarks. The random component of agent preferences is a salient feature 
of the Choo-Siow model. Due to this randomness, the equilibrium marriage distribution 
predicted by the model will not be positive assortative, even when the observed attributes of 
the agents are one-dimensional. This is consistent with empirical data. Even in experiments 
where the agents are parameterized by ordered types, such as age, observed marital data 
will almost never be genuinely assortative. For example, in any given population it is 
unlikely to be true that the age of the youngest woman married to a 34 year old man 
always exceeds that of the oldest woman married to a 33 year old. Similarly, one always 
finds matches in large populations that pair high with low qualities as measured by any 
given ordered observable characteristic (e.g. income, years of education, etc.). Thus a 
strictly assortative framework fails to explain the presence of, for example, the existence 
of PhD graduates married to high-school drop-outs. 

The classic transferable utility model of the marriage market, introduced by Gary Becker 
PQ, in principle predicts how agents will marry given exogenous preference parameters. 
However, it has seldom been estimated. There are two main obstacles to estimating a model 
of the marriage market. First, equilibrium transfers in modern marriages (except in the case 
of dowries) are not observed. Hence any behavioural model that requires their presence 
in data is not identifiable. Second, real-world agents are described by discrete, multi- 
dimensional, possibly unordered, types. But the classic Becker model predicts positive 
assortative matching under the assumption that agent type is one dimensional, continuous, 
ordered, and that preferences are super-modular. This positive assortative matching is 
limiting, but does ensure that his model predicts a unique marital distribution. 

The Choo-Siow model eliminates the structural assumptions of the classic model. First, 
it is not necessary to observe transfers in order to determine the equilibrium marriage 
distribution generated by the model. In fact, we provide an explicit formula for the equi- 
librium marriage distribution in terms of the derivative of the Legendre transform of a 
known function. Second, the model places no a priori structure on the nature or number 
of types that agents (men and women) can have. This allows consideration of a wide range 
of attributes, like race, religion, level of income, and educational achievements. 

In this more realistic framework, with its lack of structure for the agents' deterministic 
preferences and types, the issue of whether there exists an equilibrium marital distribu- 
tion, and if so whether it is unique, becomes a question of fundamental theoretical and 
econometric significance. The theoretical importance arises from the fact that uniqueness 
of equilibria in two-sided matching problems is usually not better than a generic prop- 
erty, except perhaps in certain convex programming settings like [10] [TT], which include 
continuous Monge-Kantorovich matching [16] [I]. Further, the randomness considered in 
the model below is the commonly used extreme value logit type, thus any result that de- 
scribes properties of the equilibrating matches has potentially wider applicability. The 
econometric importance arises from the fact that models of the marriage market are useful 
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to econometricians only insofar as they make unique predictions of a marital distribution, 
given exogenous preferences. Prom a practical point of view having closed form solutions 
which permit comparative statics may be even more crucial. 



Our presentation emphasizes the stochastic heterogeneity that differentiates the Choo- 
Siow model from classical models. The competitive framework, which uses transfers of 
utility from spouses to equilibrate the market, is explored in detail in Choo-Siow [7] but 
treated here only in It should be noted at the outset that the methods developed here 
also apply to other non-transferable utility models present in the literature. For example, 
Dagsvik [8] develops a model of the marriage market which uses an assignment algorithm 
(deferred acceptance) rather than utility transfers to sort matches, but his equilibrium 
conditions are functionally similar to ours. 

2.1. Setting. What is exogenous in this model are the observed types of men and of 
women, the numbers of men and women of each type in the population, and the total 
gains 7Tjj of marriage between a man of observed type i and a woman of observed type 
j, relative to both partners remaining single. The quantity iVij will not reappear until 
(|7|). On the other hand, individual agents have a utility function that depends on both an 
endogenous deterministic component that captures systematic utility, and an exogenous 
random one that models heterogeneity within the population of each given type. Thus the 
utility accrued by a man of type i and specific identity g who marries a woman of type j 
is assumed to be: 



the case j = represents the utility of remaining single. The deterministic component is 
77™; its endogeneity can be interpreted to reflect the possibility of interspousal transfer, as 
in Choo-Siow [7] and <J7] below. It is set in equilibrium, and depends explicitly on the type 
of the man and the type of the woman, and implicitly on market conditions, i.e. on the 
relative abundance or scarcity of men and women of each different type. The random term 
€ij g depends additionally on the specific identity of the man, but not on the specific identity 
of the woman. Hence a specific thirty-five year old man may have stronger than typical 
(with respect to his age group) attraction for fifty-year old women. But this attraction does 
not depend on whether, for example, the older woman has an especially strong attraction 
to younger men (assuming this latter characteristic is unobservable in the data and hence 
not reflected in j). 

The random term is assumed to have the Gumbel extreme value distribution described 
in Appendix El This distribution was introduced to the economics literature by McFadden 
[19j . It represents a severe simplifying assumption in our model, but has recently relaxed 
in work by Galichon and Salanie [15] which allows heteroskedasticity, for example. Our 
homoskedastic model admits only a single scaling parameter a which measures the degree 
of randomness; its reciprocal can be interpreted as the signal to noise ratio. It is equal to 
unity in the original Choo-Siow model. For illustrative purposes, we will have occasion to 
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allow a to vary and in doing so embed the Choo-Siow model in a one parameter family of 
models that differ by the degree of randomness present in them. 

In contrast with deterministic matching models, agents of a particular type do not have a 
uniform preferred match. This is because their preferences depend on the random variable 
Cijg. Using the Gumbel structure, the probability that a man of type i prefers a woman of 
type j among all other possible marital choices k £ {0, J} is given by 

eX p(IVL) 

(2) Pr(Man of type (i,g) prefers a woman of type j) - 



7 n rrL 

(see Appendix [A] for a derivation). This probability distribution is endogenous, because 
it depends on the various rf^. Note that it does not depend on the specific identity g of 
the man of type i, since the noise is identically distributed for each different g. Yet it is 
possible already to see how the equilibrium marriage output will differ markedly from a 
deterministic one. Whereas in the deterministic case all members of a given type typically 
have the same preferred match, here the preferred matches of type % men are smeared 
across all female types according to the distribution defined by ([2]). The mean and spread 
of the smearing are determined by the endogenous values rf^ and by a, respectively. 

Consider a G [0,oo]. The case a = 1 corresponds to the Choo-Siow model where some 
smearing is present. The case a = corresponds to a deterministic matching model, for 
which there is no smearing. Indeed, as a — > 0, the largest exponentials dominate all others, 
and the probability that a man of type % prefers a woman of type j converges to or 
#argmax{97 m |o<j<J} depending on whether or not rf^ weakly dominates all other preference 
parameters 77^. Conversely, as a — > 00, the stochastic term dominates the utility function, 
and the resulting probability distribution converges to the uniform distribution. In this 
case, there is maximal smearing, as preferences are completely random, constrained only 
by availability of prospective partners to marry. 

Female preferences are also smeared, and the equilibrium marriage distribution is deter- 
mined when rf^ and t}{- are such that the number of desired marriages of each type is the 
same on both sides of the market. 

We now elaborate on the Choo-Siow model. We henceforth fix a = 1; since preferences 
are relative, this normalization can always be attained by rescaling all of the preferences 
in the model. 



2.2. The Choo-Siow model. Suppose we wish to predict the number of marriages be- 
tween men and women of different types. The number of men of type i is denoted mj. The 
number of marriages of type i men to type j women is denoted jjLij . The number of type i 
men (respectively type j women) who choose to remain single is denoted by /ijo (and noj 
respectively). If each man marries his preferred woman, the equality 



(3) 



Pr(Man of type (i,g) prefers a woman of type j) = 
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will be valid, or at least as the population size becomes large, the right hand side of the 
equality converges to the left hand side by the law of large numbers, or the maximum 
likelihood theorem. 

Using equations ©-Q to compute the ratio of the probability that a man of type i 
prefers a woman of type j to the probability that he prefers to remain single, we arrive at 
the following formula: 

(4) fJ-ij = ^/C 

These IxJ equations are in fact quasi-demand equations, because they indicate the number 
of type jjLij marriages that men of type i would like to participate in. Viewing the female 
market cohort as the supply side, there are analogous supply equations. Letting the utility 
acquired by a woman of type j and specific identity h who marries a man of type i be 

(5) Vl jh = 4 ] +^h, 

the above analysis produces I x J supply equations of the form: 

J 



(6) 4 = '—^.r 

e V 0j 

The equilibrium output in the Choo-Siow model is a specification of fiij for all < i < I, 
and all < j < J. This output is obtained by requiring that supply balance demand: 
\x\- = fi™. Under this market-clearing hypothesis, we have the following equation. The 

endogenous parts of the rj™, rftj, t/-q, rj^- are eliminated upon adding them to arrive at 
the definition of an exogenou^l aggregated gains variable ntj associated to each observable 
type of marriage: 



m ._ Vij + 4j - Vm ~ vj 

{<) TTjj •— 2 

Using the market-clearing hypothesis, we may re-write the equilibrium condition fj,™ 
=: fiij in terms of the exogenous variable 7Tjj as follows: 

(8) ^ =e*v. 



Finally, letting LTjj = e Wij , the equilibrium output is given by 
(9) -J±= II,,, 

The equilibrium conditions expressed in equation ([9]) are implicit. They give necessary 
conditions for real numbers faj to be an output of the Choo-Siow model. However, they 
are not sufficient; a secondary set of necessary conditions, population constraints, must 



^See 33 below or Choo and Siow [7] for an explanation in terms of spousal transfers. 
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also be satisfied. Let there be I types of men, and J types of women. The number of men 
of type i is denoted m^, and the number of women of type j is denoted fj. The vector 
whose i th component is rrij, and whose (I + j) th component is fj, is denoted by v. Called 
the population vector, it has (/ + J) components and may also be denoted by [m | /]. 

A specification (/%•) o<;</ of the number of marriages and singles of each type is called a 

o<j<j 

marital distribution. The following population constraints must be satisfied by all marital 
distributions, and are a consequence of the definitions: 

J 

(10) Mio + ^Mij = rrii, 

i=i 

(11) Moj + X^J = 

i=l 

(12) in, > 0. 

Several questions naturally arise. We call these questions the Choo-Siow inverse problem: 

Problem (Choo-Siow inverse problem) Given a gains matrix II = (Iljj) and a pop- 
ulation vector v = [m | f], does there exist a unique marital arrangement generating H? 
In other words, assuming the entries ITj to be non-negative and irii and fj to be strictly 
positive, does exactly one matrix (fJ-ij) with non-negative entries exis$ that satisfies ([9])- 
(I12j) ? Furthermore, can the qualitative dependence of {Hij) on the exogenous parameters 
II = (IIjj) and v = [vj\ be described? 

This problem is important for several reasons. First, the implicit conditions present 
in equation ([9]) are the equilibrium outcome of a competitive market. There are not so 
many realistic environments with finitely many agent types and many commodities which 
are known to generate unique competitive equilibria — except possibly generically. While 
there are generic uniqueness results for matching problems that can be reduced to convex 
programing problems such as Monge-Kantorovich matching, e.g. [16] [I] |10| the sto- 
chastic heterogeneity prevents the equilibrium in our model from being formulated as such. 



Some readers have pointed out that for each realization of the randomness, balancing supply with 
demand amounts to solving a linear program. In this setting, existence of an equilibrium (and its generic 
uniqueness) are well-known [16] [4]. However, instead of balancing supply with demand for each realization, 
the Choo-Siow model is based on balancing expected supply ([4]) with expected demand ([6j|. Since the 
expected supply (or demand) need not correspond to the actual supply (or demand) for any realization of 
the randomness, existence of an equilibrium in the sense prescribed by Choo and Siow is not obvious. Unlike 
a Nash equilibrium, in which no individuals have both the incentive and the opportunity to change their 
marital status, in a Choo-Siow equilibrium this will only be true in some average sense. Never the less, the 
notion has proved useful empirically, and might even be construed to reflect the metastability (as opposed 
to stability) displayed by actual marriage markets. As simple examples show, such equilibria generally 
correspond to interior points in the feasible polytope denned by the population constraints, hence cannot 
be selected by the minimization of any linear function denned on this polytope. Nevertheless, we shall show 
they do exist and are selected by the minimization of a specific strictly convex function introduced below. 
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Instead, stochasticity restores uniqueness without the need for a genericity assumption in 
our model — and indeed in the more general setting studied by Galichon and Salanie [15] . 

Second, an affirmative explicit solution to the Choo-Siow Inverse Problem makes the 
Choo-Siow model useful in econometric analysis. The matrix II is exogenous and unob- 
served in data, but can be point-estimated from an observed marriage distribution. An 
economic or social shock will affect the systematic utilities that agents of various types in- 
cur by marrying agents of various others, and will therefore alter the value of II. This effect 
can be approximated to form an updated matrix of aggregated systematic parameters II . 
Existence and uniqueness guarantee that there will be exactly one marriage distribution 
that results from the shock, making the model predictive. In the same vein, demographers 
are often interested in predicting how marriage distributions will change due to changing 
demographics, i.e. changes in the population vector v. Our closed form solution makes it 
possible to compute the sign and in some cases the magnitude of such changes explicitly. 

Finally, if the model can be shown to admit a unique distribution, the estimated pa- 
rameters iiij are an alternative characterization of the observed marriage distribution. The 
recharacterization is useful because the parameters of the Choo-Siow model have a be- 
havioural interpretation, and are not merely observed data. 

2.3. Summary of progress and related literature. The related local uniqueness ques- 
tion was resolved by Choo and Siow in [7]. However the issue of global uniqueness was 
left open, and posed as an open problem in a subsequent working paper by Siow [23] , 
We resolve this question positively by introducing a variational principle and a change of 
variables which allows us to exploit convexity. The question of existence of (/%) for all 
II = (e Wi j) was addressed in a working paper of Choo, Seitz and Siow [6] by appealing 
to the Tarski fixed point theorem; see also the related results of Fox [12] and Dagsvik 
[8]. However the proofs there are long and involved, whereas the variational proof in the 
present paper is simple and direct and follows from continuity and compactness by way of 
an elementary estimate. Moreover, it leads to an explicit representation of the solution. 
This allows us to rigorously confirm various desirable and intuitive features of Choo-Siow 
matching, whose presence or absence might in principle be used as a test to refute the 
validity of various alternative matching models. Among other results, we show for example 
that an increase in the number of men of a given type increases the equilibrium transfer 
paid by such men to their spouses, while also increasing the percentage of such men who 
choose to remain single. See Theorem [2] below for related statements and more surprising 
conclusions. Independently of us, Galichon and Salanie [H] [15] developed a variational 
approach which extends the Choo-Siow model by allowing for randomness of much more 
general form. From theoretical considerations, they derive a closed-form expression for a 
strictly concave social welfare function governing competitive equilibria in their models, 
and identities relating different sets of endogenous variables under consideration. This 
gives an alternate approach to existence and uniqueness of equilibria in our setting as a 
special case. Apart from its convex analytic nature, their approach is quite distinct from 
ours, and leads to very different insights and interpretations. 
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3. Precise Statement of Results 

In the preceding remarks, the Choo-Siow inverse problem was phrased in terms of finding 
existence and uniqueness of equilibrium \x given exogenous data II and v. As the name 
suggests, it is also useful to think of this problem as one of inverting a function. From this 
point of view, even though II is exogenous, we may prefer instead to consider II as the 
image of a marriage distribution under a transformation that we seek to invert. 

Remark 1 (Incomplete participation). From the market equilibrium point of view, the 
fact that the left hand-side of ([9]) becomes infinite when /i^o or ji^j is equal to zero is 
unproblematic. It means that for no finite value of the exogenous Yi is sufficient to induce 
all the representatives of some type to marry. However from the inverse problem point of 
view, it is necessary to stipulate that (iiQ and noj be strictly positive. 

3.1. Preliminaries. Let us begin with a reformulation of the Choo-Siow inverse problem; 
Siow attributes this reformulation to Angelo Melino. Let /3j := and /3/ +J - := /^oj denote 
the number of unmarried men and women of types i = 1, . . . ,1 and j = 1, . . . , J respec- 
tively. Since the gains matrix ([9]) can be used to express each component mj = /3i/3j + jYLij 
of the marital distribution in terms of these new variables, the population constraints 
(fT0]) - (fTT]) can be reduced to a system 

J 

$ + Y pipi+jnn - "i = o, i < * < i, 

3=1 

I 

(13) 0j +j + ^Pi+jKij ~ "i+j = 0, 1 < j < J- 

i=l 

of (/ + J) quadratic polynomials in the (/ + J) variables {fik}k=i counting the number of 
umarried men and women of each type. 

A solution to this system of equations is a vector of amplitudes (3 that has (/ + J) 
components. Abstractly, its components might be real, complex, or both. The Choo-Siow 
Inverse Problem is equivalent to showing that the polynomial system (|13p has a unique 
solution with real positive amplitudes for all gains matrices n and population vectors 
v = [m | /] with positive components. The full marital distribution satisfying (|9l)- (ll2p 
is then recovered by choosing fiij = (3i(3i + jHij. Our proof is variational. We construct a 
functional E((3) with the property that (3 is a critical point of E — meaning a point where 
E has zero derivative — if and only if (3 satisfies equation (I13D . We then show that E 
has exactly one critical point in the positive orthant (R + ) lJrJ , and give a formula for this 
critical point using the Legendre transform of a related function. 

The main result of this paper is the following theorem, which solves the Choo-Siow 
Inverse problem. 

Theorem 1 (Existence, uniqueness, and explicit representation of a real positive solution). 
If all the entries of Yl = (Hij) are non-negative, and those of v = [m \ f] are strictly 
positive, then precisely one solution (3 of (I13p lies in the positive orthant oflH I+J . Indeed, 
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the solution b := (log/?i, . . . ,log/3/+j) satisfies b = (DH) 1 (z/) = DH*(v) where H{b) and 
H*(v) are smooth strictly convex dual functions on R/ + ^ defined by 

i+J I J 

(14) H(b) := - £ e 2bk + E E ^ h+bl+j 



2 

k=i i=i j=i 



and 



(15) H *{u):= sup (u,b)-H(b). 

Here (•, •) denotes the inner product on R/ +J . 

Remark 2 (Unpopulated types). In case m, = or fj = 0, we simply reformulate the 
problem in fewer than I + J variables, corresponding only to the populated types. This 
reformulation shows the conclusions of Theorem {Jj extend also to population vectors v = 
[m | / ] whose entries are merely non-negative, instead of strictly positive. 

Since each matrix (/x y -) with non- negative entries solving (|U|)- (|12p corresponds to a 
solution (3 of (fT3|) having positive amplitudes = yfflja and f3j + j = ^/JIoJ, this theorem 
gives the sought characterization of (/iy) by IT. Moreover, this characterization facilitates 
computing variations in the marital arrangements in response to changes in the data (II, v\. 

Theorem 2 (Comparative statics). Let the unique solution to the Choo-Siow inverse prob- 
lem with exogenous data IT and v be given by /3(IL,v). Then (a) the percentage change of 
singles fi\ with respect to the population parameter vi turns out to define a symmetric and 
positive definite matrix 



(16) r M 



k . 



here k, £ E {1, . . . , I+J}. This symmetric positive- definiteness implies, among other things, 
the expected monotonicity r^k > 0, the unexpected symmetry r^i = r^, and more subtle 
constraints relating these percentage rates of change and the corresponding substitution 
effects such as \r M \ < y/r kk r ee . 

(b) Additionally we can account for the sign, and in some cases bound the magnitude, 
of each entry of the matrix R = (r^t). To avoid trivialities, assume no column or row of 
II vanishes, so no observable type of individual is compelled to remain single. Then, 

(17) r M < 0, 

if k € {1 ... 1} and £ £ {I + 1 . . . I + J} (or vice versa). Second, if k,£ G {1 . . . I}, then 

ho > 1 / pi \ x f if k ^ I 

^ 2^ + I * )rwn> *" := { 1 otherwise. 

Similarly, (fT8|) also holds if both k, £ G {I +1 ... I + J}. 

These qualitative comparative statics have a simple interpretation. Increased supply of 
any type k of man coaxes more women into marriage (I17h and decreases the number of 
men who wish to marry. The last statement of the theorem says that this decrease is not 
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merely due to the fact that there are more men. Rather, men of type £ ^ k who would have 
chosen marriage under the old regime choose to be single after the shock (|18p . The following 
corollary explains how the additional women are coaxed into marriage: it shows increased 
competition among men leads to larger equilibrium transfers to their female spouses, as is 
further explained in subsection §[7J The corollary asserts not only monotonicity of utility 
transferred by men of type i, but also of the percentage who choose to remain single, as a 
function of their abundance in the population. 

Corollary 3 (Utility transferred and non-participant fraction increase with abundance). 
For all i < I, j < J, and k < I + J , with the hypotheses and notation of Theorem [H 

( 19 ) ^foJ-'ft) > ° 

(20) and > 0. 



du k \ v k . 

4. Derived statics and further conjectures 

It is possible to express many quantities of interest in terms of the symmetric positive- 
definite matrix R = (rjj ) from Theorem [2J whose entries encode the relative change in the 
number of type % individuals who choose not to marry in response to a fluctuation in the 
total number Vj of type j individuals in the population. For example, in the Choo-Siow 
model, the number of marriages fiij of type i men to type j women is given by the geometric 
mean ([9]) of the number of singles of the two given types times the corresponding entry in 
the gains matrix: /ijj = IIy(^o/%) 1/2 = HijPiPj. Since is exogenous, we immediately 
obtain a formula 

/o-n dlog 1 

(21) -^ k -=2 {nk + rk ^ ) 

showing the relative change in the number of type marriages caused by fluctuations 
in the total population of type k individuals is just the average of the relative changes 
rik '■= 20(log @i)/dvk an d rr+j,fc in the numbers of unmarrieds of the corresponding types 
i and j. 

We may also consider fluctuations in the number of singles of type k in response to 
changes in the exogenous gains parameters Uij when the population v of each type of man 
and woman is held fixed. In section [721 the implicit function theorem is used to derive 



dfa nn m k , 0/3, 



A- 



(22) j£r = -Afr+iCip + - 

dUij dui ih'i . j 

for all £ G {1, ... , /}, j 6 {1, . . . , J}, and k € {1, . . . , I + J}, or equivalently 

(23) -M- = -m-^ + rk ^ ) - 

The equation (I22p has an intuitive interpretation. An increase in the total systematic gains 
to an marriage (produced, for example, by an isolated increase in the value of type 
j marriages to type i men, or an isolated decrease in the value of remaining single) has 
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the same effect as decreasing the supply of the men or women of the respective types by a 
proportionate amount, weighted by the geometric mean of the unmarried men and women 
of type i and j. 

Now Theorem[2]shows the the summands above to have opposite signs, so the sign of their 
sum rfci+rk i+j will fluctuate according to market conditions. If k = i or k = I+j however, 
it is natural to conjecture that this sum is positive, in which case (|2ip shows the the number 
of type marriages fly would be an increasing function of the population size of type 

i (and independently, type j) individuals. Similarly f|23|) then asserts the number of type 
i and j singles to be a decreasing function of the total gains Iljj for type marriages. 
This conjecture amounts to requiring each positive diagonal entry r^k in the matrix R to 
dominate each negative entry — r^i in its row or column — a plausible strengthening (whose 
proof, alas, eludes us) of the claim < r^kfu established in Theorem 12(a)- 

4.1. Summary of comparative statics. Before turning to the proof of the theorems 
listed above, let us conclude by recapping our comparative statics. 

• Increasing the men of a given type increases the number of singles of all male types, 
and decreases the number of singles of all female types. 

• Increasing the number of men of a given type increases the transfer they must pay 
to any woman they marry (|19p . 

• The percentage rate of change of unmarried men of type i due to increases in women 
of type j is equal to the percentage rate of change of unmarried women of type j 
due to increases in men of type i. 

• The marital participation rate Y^j=x decreases with an increase in own type m,j. 

5. A New Variational Principle (Proof of Theorem UJ) 

5.1. Variational method: existence of a solution. Consider the function E : ~R I+J — > 
RU {+oo}, defined as follows: 



It diverges to +oo on the coordinate hyperplanes where the (3k vanish, but elsewhere is 
smooth. 

We differentiate and observe that (3 is a critical point of E if and only if holds. Notice 
strict positivity of the components of v = [ m \ f ] implies the corresponding component of a 
solution (3 to (|13p is non- vanishing, hence no solutions occur on the coordinate hyperplanes 
which separate the different orthants. In words, the critical points of E are precisely those 
that satisfy the system of equations we wish to show has a unique real positive root. It 
therefore suffices to show that E(/3) has a unique real positive critical point; for then (j 1 3I> 
admits exactly one real positive solution. Let us show at least one such solution exists, by 
showing E((3) has at least one critical point: namely, its minimum in the positive orthant. 



(24) 




k=i 



i=l j=l 



k=l 
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Claim 4 (Existence of a minimum). If all the entries of II = (Jiij) are non-negative, and 
those of v = [ m \ f ] are strictly positive, the function E(f3) on the positive orthant defined 
by (|24p attains its minimum value. 

Proof. Since E(f3) is continuous, the claim will be established if we show the sublevel set 
B\ := {(3 £ (R + ) /+J | E{(3) < A} is compact for each A G R. Non-negativity of Ikj 
combines with positivity of v\~, /3fc, and the inequality log ft < ft — 1 to yield 

I+J 1 

(25) E{f})> ]T-ft 2 -^(ft-l) 

fc=i 
j I+J 

(26) = ^( /3fc _ Vfc )2_( I/fc _ i)2 + L 

fc=l 

It follows that B\ is bounded away from infinity. Since E{(3) diverges to +oo on the coordi- 
nate hyperplanes, it follows that B\ is also bounded away from the coordinate hyperplanes 
— hence compactly contained in the positive orthant. □ 



5.2. Uniqueness, convexity, and Legendre transforms. With this critical point char- 
acterization of the solution in mind, let us observe for (3 £ R /+t7 in the positive orthant, 
defining 6& := log ft implies E(/3) = H(b) — (v,b), where H{b) is defined in (fT4|l . Since 
the change of variables f3k & R+ 1 — > ^fc = log ft £ R is a diffeomorphism, it follows that 
critical points of H (b) — (z/, b) in the whole space H I+J are in one-to-one correspondence 
with critical points of E(f3) in the positive orthant. 

On the other hand, H(b) is manifestly convex, being a non-negative sum of convex 
exponential functions of the real variables in fact Hjj > shows the Hessian D 2 H{b) 
dominates what it would be in case II = 0, namely the diagonal matrix with positive entries 
diag[2e 2l>1 , . . . , 2e 2bl+J ] along its diagonal. Thus H (6) is strictly convex throughout H I+J , 
and E{(3) = H(b) — {is, b) can admit only one critical point /3 in the positive orthant — the 
minimizer whose existence we have already shown. The solution (3 to (|13p which we seek 
therefore coincides with the unique point at which the maximum is attained. 

This last fact means that b maximizes the right-hand side of the following equation: 

(27) H*{u):= sup (u,b) - H{b) 

sup -E(j3). 

The function H* defined pointwise by the above equation is the Legendre transform or 
convex dual function of H; see Appendix [B] for details. It follows that the solution b 
satisfies v = DH(b). Thus b = DH*(v) by the duality of H and H* . This provides an 
explicit formula for b in terms of the derivative of H*. 
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6. Comparative Statics (Proof of Theorem [2]) 

6.1. Positive definiteness (a). Our representation of the solution in terms of the Le- 
gendre transform of the convex function H can be used to obtain information about the 
derivatives of the solutions with respect to the population parameters v. 

Suppose we wish to know how the number of marriages faj = Hij/3i/3i+j of each type 
varies in response to slight changes in the population vector u, assuming the gains 
matrix II remains fixed. This is easily computed from the percentage rate of change r k i in 
the number /3| of unmarrieds of each type, which is given in terms of the Hessian of either 
(fTD or (fT5D bv 

1 dB 2 

(28) ™ := w k ^i = 2Dl ^ {v) = 2(D2H ^o g ^. M+J) )^ i < k,£ < i + j. 

To see that these equalities hold, observe that the solution f3 is the point where the 
maximum (|15p is attained. The Legendre transform H*{v) of H defined by this maximum is 
manifestly convex, and its smoothness is well-known to follow from the positive-definiteness 
of D 2 H(b) > as in Lemma Moreover b = DH*(DH(b)), whence the maximum (|15p is 
attained at b = DH*(v) and D 2 H(b)' 1 = D 2 H*{DH{b)) = D 2 H*{u) > 0. This positive 
definiteness implies the first half of the Theorem 2. 

6.2. Qualitative characterization of comparative statics (b). To complete our qual- 
itative description of the substitution effects in this section, we apply the following theorem 
from functional analysis to matrices T : R n — > R n . We define the operator norm of such 
a matrix by ||T|L„ := max |T(z;)|/M, where \v\ = (v,v) 1 / 2 denotes the Euclidean norm. 

Theorem 5 (Neumann series for the resolvent of a linear contraction). // ||T|| p <1 for 
T : R n — > R n , the operator (1 — T)" 1 exists and is equal to J2T=o^ k - 

Next, we consider the matrix -D 2 ^(^)|(io g /3 1 ,...,iog/3 /+J )) an d derive properties of its inverse, 
whose entries give the various values of r^i/2. Differentiating the known function H (b) twice 
yields a positive-definite (I + J) x (J + J) matrix which can be factored into the form 



(29) 2R 1 - D 2 H\ b={log/Su . ^ logfSl+j) - A ^ ) 

where A = diag[e &1 , . . . , e bl+J ] = diag[/3], while Aj and Aj are I x I and J x J diagonal 
submatrices whose diagonal entries are all larger than two: 

1 3 

(A/)* = 2 + -2 IWi+j = 1 + 4, 

Pi j= i Pi 

Pi+j i=i p i+j 

Here we have used the fact that the values j3 are critical points and therefore satisfy the 
first order conditions (| 13|) to simplify these diagonal terms. 
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There are determinant and inverse formulae for block matrices which assert [T7] that 



/A/ n\ 4 / (A 7 - nAj^n^ 1 -(A/ - nAj~ 1 n T )~ 1 nAj _1 \ 
[u T Aj) -[-iFArHAj-iFA^nyi (Aj-n^n)- 1 )■ 



The determinant (|30p is positive by (|29p and Theorem [TJ We will now show that the 
eigenvalues of the matrix A(s) = A/ _1 snAj _1 sn T , appearing in ()30p - (|3ip are bounded 
above by 1 and below by —1 for all values of s G [0, 1]. This will have implications respecting 
the signs of the entries of (|3ip . whose (k, €) th entry is in fact equal to PkPe r ke/^ hence shares 
the sign of the change (|17p - (|18p which we desire to estimate. Namely, it will allow us to 
apply Theorem[5]to block entries such as (A/ - nAj _1 n T ) _1 = (1 - A(l))" 1 A/" 1 in (I3T1) . 

Let \ max (s) be the largest eigenvalue of A(s). Then, the smallest eigenvalue of (1 — A(s)) 
is equal to (1 — \ max (s)). We proceed by continuously deforming from s = to s = 1: The 
eigenvalues of (1 — A(0)) are equal to 1, as A(0) is in fact equal to the zero matrix. Since 
det(l— A(s)) > for all s G [0, 1], continuity of X max (s) and the intermediate value theorem 
imply that 1 — X max (s) > for all s, so that X max (l) < 1. Since no row of II vanishes, A(s) 
has positive entries whenever s > 0. The Perron-Probenius theorem therefore implies that 
any negative eigenvalue A of A(l) is bounded by |A| < X max (l). 

Since A has positive entries and 1 1 -A || op < 1, Theorem [5] indicates that the entries of 
(1 — A)^ 1 are all positive — exceeding one on the diagonal. But Pk@e r ke/2 coincides 
with the (k, l) th entry of (1 - A)" 1 diag[/3^/(/3^ + ui), 0j /(/3| + uj)], giving the desired 
inequalities (|18p whenever k,£ G {1, . . . ,/}. The signs of the remaining derivatives (|17p - 
(|18p may be verified by applying the same technique to the three other submatrices present 
in (|3ip . thus completing the proof of Theorem EJa)-(b). 

7. Transfer utilities (Proof of Corollary [3] and subsequent remarks) 

7.1. Varying the population vectors v. Given a specification of LT and v = [mi \ fj], 
the Choo-Siow model predicts a unique vector (3 = [ | /jLqj ] of unmarrieds. Given a 
fixed IT and a fixed /3, the full marriage distribution can then be uniquely recovered. It is 
therefore possible to view /i as a single valued (smooth) function of II and v. By Theorem[2l 
the signs of r^i are independent of II and v and depend only on whether k G {1, . . . , /}, or 
k G {/+ 1, . . . ,I + J}, and likewise for i. It is perhaps useful to visualize these comparative 
statics as the entries of the matrix Dj3 with D^fik := Then, Dj3 is a block matrix 

that is positive in its upper-left and lower-right blocks, and negative in its upper-right and 
lower-left blocks. Schematically, (|17p - (|18p yield 



(30) 




and 



(32) 
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Reverting back to the Choo-Siow notation for unmarrieds and population vectors, we 
have for all k and I: 

(33) ^°>0, %°<0, ^>0, ^<0. 

These basic comparative statics yield qualitative information about other more complex 

quantities of interest. As indicated following equation ((7|), the quantity rffj + r]{j — rf^ — rj^ 

is exogenous, whereas the first two individual summands are separately endogenous and 

determined within the model. In the original formulation of this model, present in [7], our 

endogenous payoff Vij = Vij~ T ij 1S separated into a systematic return fjW presumed to be 

exogenous, and a utility transfer from husband to wife, which is endogenous and set in 

f f 

equilibrium. Similarly, r)\- = f]\ - + t^j . 

In equilibrium ([6]), both of the following equations hold: 

(34) log(/%) - log(/%) = '//; '/u ; '/// • ~ij %■ 

(35) log(/^)-logGM = r)%- V % = rj%-T ij -fj%; 

there is no utility transferred by remaining single. Subtracting one from the other, we see 
that: 

(36) log(^) = 2ry+c ii , 

where Cij = {f\{- — v^- — fj™ + fj^) is exogenous. 

We denote the differentiation operator by / (suppressing the dependence on k). 

Differentiating c^- = (r){- - It^ - fjL - rfi- + ff$) and ([36j) yields: 



a„ I'/ij 'kj ) — * T iJ — ,, 



The inequalities (|33|) now determine the sign of r\j, which depends on the differentiation 
variable u^. Since /x'jo and /ubj have opposite signs, according to Theorem 2, we find 

< 37 > £>»■ 

which means the transfer of type i men to each type of spouse must increase in response 
to an isolated increase in the population of men of type i. This is expected because an 
increase in the number of type % men introduces additional competition for each type of 
women, due to the smearing present in the model. To decrease the number of type i 
men demanding marriage to a particular type of woman to a level that permits one-to-one 
matching requires an increase in the transfer to crowd out some men. 

While in principle the men might re-distribute so that the proportion of married men 
remains the same, our next computation shows this is not the case. We consider the marital 
participation rate of type k individuals, or rather the non-participation rate Sfc(f) := fii/vki 
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defined as the proportion of individuals who choose not to marry. Differentiation yields 



77— = -f v k r kk - 1 
ov k v A k 



k "k - H k 



> ^k + Pi 

according to f|18f) . But this is manifestly positive since the number of singles of type k 
cannot exceed the total number u k of type k individuals. This means, for example, that an 
increase in the total population of type k men increases the percentage of type k men who 
choose to remain unmarried, given a fixed population of women and men of other types 
(and assuming, as always, that the exogenous gains matrix IT remains fixed). It concludes 
the proof of Corollary [3l 

7.2. Varying the gains data II (Proof of (f2~2~j) - ([23j) ). The population vector v is one 
variable of interest. However the function /3 also depends on the gains parameters II ij. 
The complete derivative D^ v ^j3 = [D u (3 \ Dn(3] is an (I + J) x (IJ + I + J) matrix. As 
such there are linear dependencies among its rows and columns. Since the matrix D u /3 
is invertible, its columns are linearly independent and form a basis of the column space. 
Hence, the remaining columns of the complete derivative can be expressed using linear 
combinations of them. The implicit function theorem applied to this problem turns out to 
yield the simple linear relationship (I22D : 

(38) m=~^^^ 

for alH G {1, . . . , /}, j E {1, . . . , J}, and k <E {1, ...,/+ J}. 

Equilibrium (fT3l) coincides with vanishing of the function F(f3, v, H) : ~R,( I+J ) + ( I+J )+( IJ ') — 3 
R /+J defined by 



Fiiy, n) =/3f + Yl PiPi+i^j ~ 1 < * < 1 

(39) Fj (u, n) = + fcPi+i n»i - vi+j , 1 < j < J. 

1=1 

The implicit function theorem stipulates that if the derivative D^Fl^^^ is invertible, 
there is a small neighbourhood around (/3o,fo,Ho) inside which for each (v, Yi) there is a 
unique /3 satisfying equation (fT3j) . and further that j3 depends smoothly on (z/, n). The 
implicit function theorem also provides a formula for the derivative of the implicit function 
j3(v, n). It is obtained by applying the chain-rule to F(f3(v, n), z/, n): 

(40) [D u f3 I D n (3] ^o,n — — [DpF] 1 [D U F I D]jF]p ^ 0t n . 

Since = — 5 k £, and = f3i/3i + j(5n + 5i + j^), the first part of the preceding formula 
yields [DpF]" 1 = D v j3, and the second part then implies (138 j) . Theorem [2] shows D u (3 is 
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invertible, so the hypotheses of the implicit function theorem are globally satisfied and our 
calculations are valid. This concludes the proof of (|22p - ([23l) . 

Appendix A. Derivation of the preference probabilities 

The random variable present in the definition of male and female utility is the Gumbel 
extreme value distribution, introduced to the economics literature by McFadden |19j : 

Definition 6 (Gumbel distribution). A random variable e is Gumbel if it has cumulative 
distribution function F(e) = exp(— exp(— e)). 

Here Pr(e < x) = F(x) gives the probability that the realization of this random variable 
takes a value less x £ R. The corresponding density function is F'(x) = f(x) = exp(— (x + 
exp(— x)). The mean of e is the Euler-Mascheroni constant, which is approximately equal 

2 

to 7 = 0.57. . .. Its variance is equal to 

We now use this distribution to derive the discrete probability distribution @. 

Lemma 7. Suppose a > and rjij € R are constants, while for each choice of j = 
0, . . . , J, the €ij g are independent identically distributed random variables with the Gumbel 
distribution. Then 

exp(^) 

o<fc<J'~ ELoexp(^)' 
Proof. It costs no generality to assume a = 1. Then 

/oo 
de F'(e) n fe ^ j F(?/ ij + e - rj ik ). 
-oo 

This formula follows from Bayes' rule for conditional probability, and independence of the 
various random variables involved. Substituting in the explicit formula for the Gumbel 
distribution from Definition [6] yields 

/oo 
deexp(-(e + exp(-e))n fe ^exp(-exp(7ftfc - rjij - e)). 
-oo 

We make a change of variables by setting t = exp(— e), so de=—dt/t. Evaluating the 
integral in the new variables yields 

/>oo 

P = dt exp(-£)n fc7 y exp(-£exp(77jjt -rjij)) 
Jo 

poo J 

= dt exp(-t ex P(^ ~ Vij)) 



(41) Pr (%- + tijg = max T r]ik + e ikg) - w 



Efe=o ex p(^fe - vn) 

Efc=o exp(r? ifc ) 

as desired. □ 
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Corollary 8 (Expected marital preferences by observed types). Suppose a man with ob- 
servable type i and (unobservable) specific identity g derives utility V^ g = rj^ + creijg from 
being married to a woman of observable type j, independent of her specific identity. If 
cr > 0, r/lj £ R and €ij g are as in Lemma [?| then the probability he prefers a woman of 
type j to all other alternatives in {0, 1, ... , J} is given by ([2]). 

Remark 3 (The Boltzmann / Gibbs distribution). The probabilities which appear in ([2]) 
and (|4ip take the form of the Boltzmann or Gibbs distributions from statistical physics, 
in which the deterministic component n™ of the utility derived plays the role of the energy 
associated with marital state j, while the strength a of the random component plays the role 
of the physical temperature. This connection is also discussed by Galichon and Salanie [H] 

Appendix B. The Legendre Transform 

Here some well-known results pertaining to convexity and the Legendre transform are 
recalled. Let F : R n — > R be a twice continuously differentiable function; F is convex if 
Hess(F) := D 2 F > 0, and strictly convex if the line segment connecting any two points on 
the graph of F lies above the graph. The Legendre transform or convex dual function to 
F(p) is denoted F*() and defined pointwise by: 

(44) F*(q)= su V {q-p-F(p)}. 

Since the supremum of affine functions is convex, it is clear that F*(q) is a convex function. 
Additionally, the following duality result is true: 

Lemma 9 (Legendre duality). Let F € C 2 be strongly convex on R n , meaning Hess(-F) > 
0. Then F* is also twice continuously differentiable. Further, ifq = DF(p), then p=DF*(q). 
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